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The  Gauss-Tchebyshev  Inequality 
for  ttiiaodal  Distributions 


by 

S.  W.  Dharmadhikari  and  Kumar  Joag-dev 


Summary 


Let  X  be  a  random  variable  whose  distribution  is  unimodal  with  mean  u. 
For  r  >  0,  let  Ar  *  (E|x  -  v|T}^r.  In  this  paper,  we  determine  a  value  ky 
such  that 

P( I X  -  u  |  s  kip  s  [r/(r  ♦  l)]r.k‘r, 

for  all  k  2  kr>  This  improves  and  extends  a  recent  result  of  Vysochanskii 
and  Petunin  (1979)  who  have  only  considered  the  case  r  «  2  with  a  higher 
value  for  k2<  Our  proof  is  also  considerably  simpler  because  it  uses  the 
convex  structure  of  the  class  of  unimodal  distributions. 


AMS  1980  Subject  Classification:  60  E  15 

Key  Words  and  Phrases.  Gauss'  inequality,  Tchebyshev's  inequality, 
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1.  Introduction. 

Let  X  be  a  real  random  variable  with  mean  p  and  let  r  >  0.  Markov's 
inequality  states  that,  for  every  given  a  and  every  k  >  0, 

(1.1)  PCI X  -  a|  a  k)  s  £(|X  -  a|r)/kr. 

If  a  ■  v  and  r  *  2,  (1.1)  reduces  to  the  usual  Tchebyshev  inequality.  Suppose 
now  that  the  distribution  of  X  is  unimodal  with  a  mode  M.  A  result  attributed 
to  Gauss  (1821)  states  that 

(1.2)  P(|X  -  Ml  2  k)  £  (4/9)  E  (| X  -  M|2)/k2, 

for  all  k  >  0.  In  other  words,  if  a  ■  M,  the  bound  on  the  right  side  of  (1.1) 
can  be  reduced  by  a  factor  (4/9)  when  r  *  2.  As  a  consequence,  if  the  dis¬ 
tribution  of  X  is  both  symmetric  and  unimodal,  then  M  »  y  and  (1.2)  gives 

(1.3)  P(lX  -  ul  ^  k)  S  4o2/(9k2). 

where  o2  «  Var  (X).  Recently,  Vysochamfcii  and  Petunin  (1979)  showed  that 

(1.3)  is  valid  without  the  assumption  of  symmetry  as  long  as  k  2  Z8/3.  In  this 
paper,  we  first  obtain  the  factor  by  which  the  bound  in  (1.1)  can  be  improved 
if  the  distribution  is  unimodal  and  a  »  M.  We  then  show  that  the  improved  bound 
is  valid  even  if  a  ■  y  as  long  as  k  is  suitably  large.  For  r  *  2,  we  need 
k  2  /I573,  which  is  better  than  the  value  /S/S  obtained  by  vysocanskii  and 
Petunin. 

2.  Preliminaries. 

In  this  section  we  give  sons  results  on  certain  convex  sets  of  distributions. 
DEFINITION  2.1.  A  distribution  function  F  is  said  to  be  unimodal  about  a 

«*,  M)  and  concave  on  (M,  ®) . 


mode  M  if  F  is  convex  on  (  - 
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Let  CM  denote  the  set  of  all  distributions  on  R  that  are  uni  nodal  about  M. 

Then  C^M  is  clearly  convex  (under  mixtures) .  It  is  also  closed  under  weak 
convergence;  see  Gnedenko  and  Kolmogorov  (1968) ,  Section  32.  Let  denote 
the  set  of  all  uniform  distributions  on  intervals  with  M  as  one  end  point. 

hull  of  UM<  Another  equivalent  statement  of  this 
result  is  as  follows;  [see  Feller  (1971) ,  p.  158]. 

THEOREM  2.1.  A  random  variable  X  has  a  unimodal  distribution  with  mode 
M  if.  and  only  if.  X  is  distributed  as  M  ♦  UZ,  where  U  is  uniform  on  (0.  1) 
and  U,  Z  are  independent. 

This  theorem  enables  one  to  reduce  many  problems  involving  unimodal 
distributions  to  those  involving  uniform  distributions. 

Let  Oj  denote  the  set  of  all  distributions  on  R  which  have  mean  u  and 
finite  support.  The  following  lemma  is  possibly  known. 


Then  is  the  closed  convex 


LEM4A.  2.1.  Every  distribution  in  is  a  finite  convex  mixture  of  one  or 


iwqpQint 


Proof.  Without  loss  of  generality,  let  ii  ■  0.  Let  P  t  Dfl  and  let  v  be 
the  size  of  the  support  of  P.  The  lemma  holds  if  v  s  2.  Suppose  the  lemma 
holds  for  v  s  n,  where  n  2  2.  Let  Y  be  a  random  variable  with  distribution 
P  and  suppose  Y  takes  exactly  (n  ♦  1)  values.  Since  Y  is  not  degenerate  and 
E(Y)  ■  0,  we  can  find  a  >  0  such  that 


€  ■  P(Y  •  -a)  >  0  and  n  »  P(Y  ■  b)  >0. 

Without  loss  of  generality,  assume  that  a£  2  bn.  Consider  the  two-point 
distribution  PQ  which  puts  mass  a/(a  ♦  b)  at  the  point  b  mass  b/(a  ♦  b)  at 
the  point  (-a).  Then  PQ  has  zero  mean  and 


I 
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(2.1)  P  -  <*P0  ♦  (1  -  o)Pj, 

where  o  ■  n(a  ♦  b)/a.  Note  that  a  PQ  accounts  for  all  the  mass  at  b.  It  is 
clear  that  o  >  0.  Oh  the  other  hand,  since  Y  takes  at  least  3  values,  we 
sust  have  C  ♦  n  <  1.  Therefore 

n(a  ♦  b)  ■  an  ♦  bn  s  an  ♦  a£  ■  a(E  ♦  n)  <  a.  Thus  a  <  1. 

The  quantity  Pj  in  (2.1)  is  a  distribution  which  puts  positive  mass  at  £  n  points, 
since  the  mass  at  b  is  accounted  for  by  aPQ.  By  the  induction  hypothesis,  Pj 
is  expressible  as  a  mixture  of  one  or  two  point  distributions  with  zero  mean. 
Therefore,  by  (2.1),  P  also  can  be  expressed  as  a  mixture  of  the  required  type. 
The  proof  of  the  lemma  is  now  complete. 

The  following  lemma  is  standard. 

LEMA  2.2.  Jgt  r  >  0  and  let  X  be  a  real  random  variable  with  E(|x|r)  <  •. 
Then  we  find  a  sequence  of  random  variables  Xn  such  that  each  Xn  takes  only  a 
finite  number  of  values  and  E(j  X^  -  X|r)  ♦  0.  Moreover,  if  r  Jt  1,  then  we 
can  shoose  the  in  such  a  way  that  E(?^)  ■  E(X)  for  all  n. 

3.  The  Gauss-Tchebyshev  inequality. 

The  Markov  inequality  states  that 

(3.1)  P(|X  -  aj  2  k)  s  E(( X  -  a|r)/kr, 

where  X  is  a  real  random  variable,  a  e  R,  r  >  0  and  k  >  0.  If  a  ■  E(X)  and 
r  *  2,  (3.1)  gives  the  usual  Tchebyshev  inequality.  If  X  has  a  distribution 
which  is  uni modal  about  M,  then  the  bound  on  the  right  side  of  (3.1)  can  be 
reduced  by  a  factor  which  depends  on  r.  This  is  made  precise  by  Theorem  3.1. 
below.  For  the  special  case  r  ■  2,  Theorem  3.1  goes  back  to  Gauss  (1821). 
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THEOREM  3.1.  Let  X  have  a  distribution  which  is  unimodal  about  M.  Than 
for  every  9  >  0  and  avery  k  >  0, 

(3.2)  P(| X  -  M)  a  k)  £  (p-~)r  CE<IX^“  Mlr?  . 

Moreover,  this  bound  is  sharp. 

Proof.  Without  loss  of  generality,  let  M  »  0.  Since  (3.2)  is  trivially  true 
if  E|x|r  ■  »,  we  asstoae  that  E|x|r  <  ®.  Since  X  is  umimodal  about  tero,  by 
Theorem  2.1,  X  has  the  same  distribution  as  UZ,  where  U  is  uniform  on  (0,  1) 
and  U,  Z  are  independent.  Now  E(x|r  »  E(|z|r)/(r  ♦  1) .  Therefore  E|z|r  <  ®. 
Lemma  2.2  shows  that  it  is  sufficient  to  establish  (3.2)  in  the  case  where  Z 
takes  only  a  finite  number  of  values.  Now  the  set  of  distributions  of  Z,  for 
which  (3.2)  is  valid,  is  clearly  convex.  Therefore  we  need  only  consider  the 
case  where  Z  is  degenerate.  Finally,  (3.2)  is  clearly  unaffected  by  a  change 
of  scale.  Therefore  we  may  and  do  assume  that  Z  is  degenerate  at  1,  so  that 
X  has  the  uniform  distribution  on  (0,  1).  In  this  case,  E|x|r  *  l/(r  *  1)  and 


P(|X|  a  k) 


J(  1  - 

I  0  • 


if  0  <  k  s  1 
if  k  a  1. 


Therefore 


krP(  J  xj  a  k) 


f", 


k) 


if  0  <  k  S  1 
if  k  a  l. 


For  fixed  r,  the  last  quantity  becomes  maximum  when  k  *  r/(r  ♦  1) .  The  maximum 
value  is  rr/(r  ♦  l)r  *  l.  Therefore 

krP(|x|  aki  E^r  ' 

which  proves  (3.2).  Further  the  above  calculation  shows  that  the  bound  is 
sharp. 
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The  special  case  r  ■  2  gives  the  Gauss  inequality. 

COROLLARY  S.l.  (Gauss).  If  X  has  a  distribution  which  is  unimodal  about 
M,  then,  for  all  k  >  0, 

P(|X  -  M|  2  k)  s  4  E( | X  -  m| 2) / (9k2) 

COROLLARY  3.t.  Let  X  have  a  symmetric  and  unimodal  distribution.  Let 
2 

U  *  E(X)  and  or  ■  Var(X) .  Then,  for  all  k  >  0, 

(3.3)  P(|X  -  u|  2  ko)  S  4/(9k2). 

Proof.  Immediate  from  corollary  3.1,  because  M  *  u. 

Recently, Vysochanskii  and  Petunin  (1979)  showed  that  (3.3)  holds  for 
uninodal  random  variables  without  the  assumption  of  symmetry  provided  that 
k  2:  /($/$) .  We  improve  and  generalize  their  results  below  (Theorem  3.2). 

Our  proof  is  also  considerably  simpler  because  we  use  the  convex  structures 
introduced  in  Section  2. 


T 


r 


THEOREM  3.2.  Let  X  have  a  unimodal  distribution  with  mean  u- 
»  E(|x  -  p|r).  Then,  for  every  k  >  0, 


P( |  X  -  y|  2  k)  &  max 


Let 


Proof.  Without  loss  of  generality  assume  that  v  ■  0.  Suppose  X  is 
unimodal  about  M.  If  0  is  also  a  mode  of  X,  then  the  theorem  follows  from 

Theorem  3.1.  So,  suppose  that  X  is  not  unimodal  about  0.  Again,  we  nay  assume 

* 

that  M  >  0.  By  Theorem  2.1,  X  has  the  same  distribution  as  M  ♦  UZ,  where 
U  is  uniform  on  (0,  1)  and  U,  Z  are  independent.  Now  0  ■  E(X)  ■  M  ♦  |e(Z). 
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Therefore  E(Z)  ■  -2M.  It  is  clear  from  Leona  2.2  that  it  is  sufficient  to 
prove  the  theorem  in  the  case  where  Z  takes  only  a  finite  number  of  values. 
Moreover,  since  the  nean  of  X  is  fixed  at  0,  the  class  of  distributions  of  X 
for  which  the  theorem  holds  is  convex.  Therefore  the  second  assertion  of 
Lemma  2.2  and  Lemma  2.1  show  that  it  is  sufficient  to  prove  the  theorem  in  the 
case  where  Z  takes  exactly  two  values.  We  have  thus  reduced  our  problem  to 
the  case  where  X  has  the  density  f  given  by 


o  ,  if  -a  <  x  <  b, 
6  ,  if  b  <  x  <  c. 


elsewhere 


Here  a,  b,  c  are  suitable  positive  constants.  A  graph  of  f  is  given  in  Pig.  1. 
Since  f  is  not  to  be  unimodal  about  0,  we  must  have  a  <  B.  Further  the 
condition  E(X)  «  0  requires  that  b  <  c  <  a.  Three  cases  arise. 

Case  1.  Suppose  0  <  k  <  b.  Here  P[|x|  <  kj  »  2otk  and  so 


k  P[  X  <  k] 


<3'4)  |«{«  |t|If(t)dt- 


Case  2.  Suppose  b  <  k  <  c.  Here 


P[|xJ  <  k]  -  a(b  ♦  k)  ♦  S(k  -  b), 


_  r*l  .  r*l.  an.r+1  .  r*l, 

k  !t|  f(t)dt  -  - k  - — - 2 


Since  a  <  6  end  0  <  b  <  k,  the  right  side  of  (3.5)  is  positive. 
Consider  the  two  cases  together.  That  is,  let  0  <  k  <  c. 


Then  (3.4)  and  (3.S)  show  that 

C3'6)  |t{<k  |tjTf(t)dt  *  "Trill - 


Now 


t  -  E|x|r  -  /  |t|rf(t)dt  ♦  /  |t|Tf(t)dt 

|t|^  |t|<k 


*  krP[|x|2k]  ♦  ,  [using  (3.6)]. 


Writing  P[|x|  <  k]  •  1  -  P(| x|  2  k],  we  get 


Tr  *  k*  *il*l  *  kl  * 


Therefore 

Cr+l)t  -kr 

(3.7)  P[|X|  *  k]  s  - - ^ - 

rkr 


Case  3.  Suppose  that  c  <  k.  Define  a  new  density  g  as  follows 


g(x)  - 


/  t  .  If0<x<c. 
I  f(x)  ,  elsewhere 
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Since  g  agrees  with  f  outside  the  interval  (0,  c),  the  constant  y  oust  satisfy 
(3.8)  yc  ■  Jq  f(t)dt  a  ob  ♦  8(c-b). 

tt 

Now  let  6  »  /  |t|rg(t)dt.  Then 

7  «fl» 

(r+l)  (T  -6J  -  (r*l)  [/Vf(t)dt  -  ;ctrg(t)dt] 

0  0 

a  obr+1  ♦  B(cr+1-br**)  -  ycr+1 

a  obr+1  ♦  8(cr+1-br*1)  -  cr[ob+B(c-b)] ,  (using  (3.8)] 

-  b(B-a)  (cr  -  bT). 


Since  a  <  8  and  0  <  b  <  c,  we  see  that  6  St.  Let  Y  be  a  random  variable 

r  r 

with  density  g.  Since  g  is  unimodal  about  0,  Theorem  3.1  shows  that 


a)  s  . 

^  K 


r  .r  Tr 


But  since  k  >  c,  the  densities  g  and  f  agree  on  the  set 
(-"»  -k]  u  [k,  «*).  Therefore 

(3.9)  p[|  x|  a  k]  -  p[|y|  a  k]  s  (JL)r  i  . 

The  theorem  now  follows  from  (3.7)  and  (3.9). 


COROLLARY  3.3.  Let  X  be  a  unimodal  random  variable  with  mean  y.  Let 

■  {E(|x-y}r)}^r.  Then,  for  every  k  >  0, 

(  (r*l)  -kr  ~\ 

PtlX-nl  .  Ur]  i  max  ^  rkr  ,  [jpfjp]  j  . 
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Proof.  Immediate  from  Theorem  3.2,  if  we  replace  k  by  kA^  and  note  X*  «  tr> 
Observe  that 

—  s  — )r  whenever  k  a  k  ,  where 

r  'r+l'  r* 

(3.10)  k  -  HkillLl. 

r  ■*-  (r+1) 

Therefore,  the  following  corollary  is  iamediate. 

COROLLARY  3.4.  With  the  same  notation  as  in  Corollary  3.3, 

P( |  X  -  u|  *  kxp  £  (^)r  k'r  , 

for  all  k  2  kf,  where  kr  is  given  by  (3.10) . 

For  a  comparison  of  our  results  with  those  given  by  Vy»acha#akii  ttd 
Petunin,  we  write  the  special  cases  of  the  last  two  corollaries  when  r  •  2. 

COROLLARY  3.5.  Let  X  be  a  unimodal  random  variable  with  mean  p  and 
,  2 

variance  a  .  Then,  for  every  k  >  0, 

(3.11)  P( | X  -  p|  i  ko)  £  max 

Consequently,  for  every  k  2  /IsT/S, 

(3.12)  P(j  X  -  pi  *  k o)  £  -iy  . 

9k‘ 

Proof.  We  only  need  to  note  that  k2  *  /19/3. 

REMARK.  The  inequality  (3.11)  is  an  improvement  of  the  result  of  Vysouhanskii 

2  2 

and  Petunin  (1979).  They  have  (4-k  )/3  in  place  of  our  3-k  /2.  Consequently, 
they  prove  (3.12)  for  all  k  2  /i/3. 
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lt  is  to  be  noted  that  (3.12)  does  not  hold  for  all  k  >  0,  if  the 
distribution  is  not  symmetric.  The  following  detailed  analysis  of  the  exanple 
considered  by  VyaQchflUftii  and  Petunin  shows  that  (3.12)  can  fail  if  k  *  1.385. 
We  note  that  1.385  <  /19/3. 

EXAMPLE  3.1.  Let  a  2  1  and  consider  a  random  variable  X  such  that 
P(X  -  1)  »  (a -  l)/(a  ♦  1) 
and 

P(X  *  x)  «  2(x  ♦  l)/(a  ♦  l)2  ,  -a  <  x  <  1. 

It  is  easy  to  check  that  y  ■  E(X)  ■  0  and 
a2  ■  Var(X)  -  (2a  -  l)/3.  Now 

P(|x|  *  1)  -  -  & 

a  1  (a  ♦  1) 

We  now  set  ko  ■  1.  That  is,  k  *  (l/o).  Then 

k2P(|X  y|  2  ko)  -  o“2  P(|X|  2  1) 

-  3(>  ’  *>  U  *  .  g(a) ,  say 

(2a  -  1)  (a  ♦  ir 

The  condition  g(a)  >  (4/9)  reduces  to. 

(3.13)  8a3  -  15a2  -  54a  ♦  77  <  0. 

Numerical  calculations  show  that  (3.13)  holds  for  1.2816  £  *  £  3.05.  Since 
k  ■  o'1,  we  see  that  (3.12)  can  fail  if  .767  £  k  £  1.385. 


■■  %a. . 

(a  ♦  ir 


-12- 


REFERENCES 


[1]  Feller,  W.  (1971).  An  Introduction  to  Probability  Theory  and  It» 

Applications .  Vol.  It  (2nd  Ed.).  Wiley,  New  York. 

[2]  Gauss,  C.  F.  (1821). 

[3]  Gnedenko,  B.  V.  and  Kolmogorov,  A.  N.  (1968) .  Limit  Distributions 

for  Sums  of  Independent  Random  Variables.  Revised  Edition. 
Addison-Wesley. 

[4]  Vysochansk.il,  o.  F.  and  Petunin,  Ju.  X.  (1979).  Justification  of  the  So 

rule  for  unimodal  distributions.  Theor.  Probability  and  Hath. 
Statist.  21,  8S-S6. 


